Communication aid for non-vocal people using corpusbased concatenative speech synthesis
نویسندگان
چکیده
This paper reports on the development of Chatako-AID, a communication aid for non-vocal people using corpus-based cocatenative speech synthesis by creating a speech corpus especially designed for such use. The concept of Chatako-AID; synthesis with the user’s voice, which makes use of precomposed texts, is highly appreciated by the target user. This confirms that the recording of a minimum set of phonetically balanced sentences is insufficient for speech synthesis in the proposed method and that a combination of the above recording and a recording of well-read continuous-text material produces more natural sounded synthesised speech.
منابع مشابه
A database design for a concatenative speech synthesis system for the disabled
This paper reports on our research on designing a speech corpora in Japanese for a concatenative speech synthesis system that is to be used for a specific purpose. For this work, the purpose was set to assist communication for non-vocal people. Four kinds of source database for synthesis were developed by combining different speech corpora created from read speech of an Amyotropic Lateral Scler...
متن کاملApplications of computer generated expressive speech for communication disorders
This paper focuses on generation of expressive speech, specifically speech displaying vocal affect. Generating speech with vocal affect is important for diagnosis, research, and remediation for children with autism and developmental language disorders. However, because vocal affect involves many acoustic factors working together in complex ways, it is unlikely that we will be able to generate c...
متن کاملDesign of English to Hindi Corpus Based Text Conversion and Hindi Text to Speech Synthesis
English is a global language but is understood by few percentage of population in India. It continues to remain a barrier for rural population to learn and compete at a global level. Machine translation helps people from different places to understand an unknown language without the aid of human translator. A Text to Speech system generatesspeech from text given as input. The proposed system wi...
متن کاملLocal minimum generation error criterion for hybrid HMM speech synthesis
This paper presents an HMM-driven hybrid speech synthesis approach in which unit selection concatenative synthesis is used to improve the quality of the statistical system using a Local Minimum Generation Error (LMGE) during the synthesis stage. The idea behind this approach is to combine the robustness due to HMMs with the naturalness of concatenated units. Unlike the conventional hybrid appro...
متن کاملRecent enhancements in CU VOCAL for Chinese TTS-enabled applications
CU VOCAL is a Cantonese text-to-speech (TTS) engine. We use a syllable-based concatenative synthesis approach to generate intelligible and natural synthesized speech [1]. This paper describes several recent enhancements in CU VOCAL. First, we have augmented the syllable unit selection strategy with a positional feature. This feature specifies the relative location of a syllable in a sentence an...
متن کامل